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Abstract 

A model for directed networks is proposed and power laws for their in-degree and/or out-degree distri- 
butions are derived from the model. It is based on the Barabasi- Albert model and contains two parameters. 
The parameters serve as regulatory factors for the contributions of new nodes through in- and out-degrees. 
The model allows a newly added link to connect directly two nodes already present in the network. Such a 
link is called an inner link, while a link attached to a new node is called an outer link. Using relationships 
between inner and outer links, we investigate power laws for in- and out-degree distributions of directed 
networks. This model enables us to predict some interesting features of directed networks; in particular, the 
World Wide Web and the networks of citation and phone-call. 



1. Introduction 

Recent studies of many real-world complex net- 
works in technological and social systems have re- 
vealed that they obey the power law degree distri- 
bution 

p{k) oc fc~ 7 . 

Here p(k) is the probability that a randomly chosen 
node (or vertex) has k links (or edges), and the expo- 
nent 7 is a constant, typically 2 < 7 < 3. 

In order to explain the empirical fact of real net- 
works, Barabasi and Albert proposed a model (BA 
model) in [3]. It could deduce the degree distribution; 
p(k) oc fc~ 3 . Their model is thought of as a model for 
undirected networks. Now we are able to find many 
other network models from which power law degree 
distributions with a variety of exponents can be de- 
duced (see [6, 8, 11] and references therein). 

The objective of this paper is to propose a model 
for directed networks whose in-degree and/or out- 
degree distributions follow power laws. For that pur- 
pose, we first review the model of undirected networks 
introduced in [14]. This model modifies the BA model 
by introducing one parameter and generates a power 
law degree distribution. The parameter regulates the 
growth rate of a network and its value falls between 
one and two, which reflects the obvious fact that 
one link has two nodes. This simple fact leads us to 
2 < 7 < 3. 

Unlike the BA model, the proposed model permits 
old nodes, i.e., nodes already present in the network, 
to connect directly by some of newly added links. We 



call such links inner links, while links attached to a 
new node are called outer links. In BA model new 
links are limited to outer links. 

The Internet and the networks of movie actors or 
phone-call can be modeled by our model, since they 
will grow by acquiring inner links as well as new nodes 
and outer links. On the other hand, citation networks 
cannot contain inner links, because new links are al- 
ways attached to new nodes (i.e., new papers). 

The model presented for directed networks in- 
volves two parameters and it also allows both inner 
links and outer links. The numbers of those links are 
determined by two parameters. The mechanism of 
connections is based on so-called 'preferential attach- 
ment'. It means that the probability that any inner or 
outer directed link attaches to an old node is propor- 
tional to its in-degree or out-degree. Considering both 
incoming and outgoing directions, two power laws of 
degree distributions are derived from the model and 
several relationships between both exponents of power 
laws are discussed. 

We will apply these results to the well-known 
directed networks; the World Wide Web, citation 
networks and phone-call networks. Considering the 
quantity of inner links or outer ones, we attempt to 
predict several rough but interesting features of these 
networks, in particular, their in- and/or out-degree 
distributions. 

A comparison of the ratios of inner links yields a 
conclusion that the WWW can be thought of as an 
intermediate of two extremes; citation networks and 
phone-call networks. 
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Our approach to the study of power laws appear- 
ing in complex networks is simple and seems different 
from others. 



2. Basic model for undirected networks 

We review the basic model for undirected networks 
that was introduced in [14] so as to facilitate the mod- 
eling of directed networks. This simple model provides 
an explanation that degree distributions of real-world 
networks tend to obey power laws with exponents be- 
tween 2 and 3. And it has a desirable feature that 
old nodes in a network are able to connect directly 
through some of newly added links. It is a natural 
phenomenon in real- world networks, while the origi- 
nal BA model ([3, 4]) did not take it into account. 

In order to explain our model for undirected net- 
works, we divide it into the following five parts (i)-(v). 

(i) We start with a network having a small number 
of nodes, say too nodes, and some links. 

(ii) New nodes and new links constantly enter the 
network. Whenever a new node enters the net- 
work, we advance the time step by one and at 
the same time a constant quantity of links, say 
to links, are added to the network. New links 
are cither inner or outer ones. 

Thus new links do not necessarily emanate from 
the new node. The number, m, of new links at each 
time step may be any positive number, since we adopt 
the continuum argument as in [4]. The numbers of 
inner links and outer ones are determined by a pro- 
portionality constant that is related to preferential at- 
tachment. The value Too and the number of the orig- 
inal links are not important, since we eventually take 
the limit with respect to time. 

We denote by Vi the ith node that enters the net- 
work at time step t% and by ki the degree of the node. 
After t time steps, we obtain a network with t + Too 
nodes and mt links plus the original links. Also no- 
tice that the resulting network becomes sparse, since 
the ratio of links present in the network at time step 
t satisfies 

2mt/[(m + t)(m +t- 1)] -> (t -> oo). 

Preferential attachment means that the probabil- 
ity, V , that any node connects to a node v is propor- 
tional to the connectivity k of the node v, so that 



V(k) 



(i) 



the 'rich-get-richer' phenomenon. 

Thus the rate at which a node w, with fc, links ac- 



quires more links is given by 

dki 
~dt 



(2) 



where g is a proportionality constant, which is to be 
determined later. The denominator in {1}, denoted by 



is the total of all degrees in the network at time step 
t. In view of (1), taking the sums over i in both sides 
of (2), we have 



dC(t) 
dt 



9- 



(3) 



Since m links are added to the network in each 
time step and each link is counted twice in counting 
degrees, the total degree C(t) at time step t satisfies 



C(t) w 2mt. 



(4) 



This implies that the growth rate g of the total degree 
is not equal to to as supposed in [3, 4], but approxi- 
mately equal to 2m. The growth rate will be equal to 
m, when one completely neglects the degrees of new 
nodes. On the other hand, it is equal to 2m, when one 
takes a full account of them. 

In order to circumvent this discrepancy, we sepa- 
rate the growth of a network into two sides. One is 
the side of old nodes, i.e., those already present in the 
network, and it is governed by differential equations 
(2) and (3). In other words, it is this side that the 
'preferential attachment' rule is applied to. The other 
is the side of new nodes and it is prescribed by giving 
the initial conditions. These considerations lead us to 
the latter parts of the model. 

(iii) We introduce a parameter (3 into the BA model 
by 9 — (3m. The parameter serves as a regula- 
tory factor for the growing network. 

(iv) We must set the initial condition for a new node 
from (3) rather than simply put fcj(ij) = to. 
The initial condition depends on g and gives the 
number of outer links. 

(v) The occurrence of inner links are allowed at each 
time step, among to new links, as the rest of 
outer links. 

It is reasonable to assume that (3 satisfies 1 < (3 < 
2 from the argument, and one can choose any value of 
(3 in the interval. However, the condition for (3 will be 
reconsidered later and it will be replaced by 1 < (3 < 2. 

Thus Eq. (2), together with (4) and g = (3m, can 
be written as 



dki ki ki 

—— = am- = p—. 

dt 2mt 2t 



(5) 
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As stated in (iv), for a chosen value of /?, we need 
the corresponding initial condition imposed upon a 
new node Vi, which enters the network at time step U. 
Relation (3) implies that g = (3m is the quantity of 
degree's increase for the side of old nodes within one 
time step. On the other hand, 2m is the total quantity 
of degree's increase. From this we see that the initial 
condition is given by 

ki(U) = (2-/?)m, (6) 

for each new node. It gives the number of outer links. 

Using (6) and the Dirac delta function S ([13]), it 
would be better to write the rate equation as 

^ = /?| + (2 - P)mS(t - ti ) (t> ti ), 

instead of (5). Then it is easy to see that relation 
(4) naturally follows by its integration and to realize 
that the first term on the right hand side is really the 
growth rate for Vi as an old node. 

Next let us examine the condition for (3 from (6). 
It necessarily implies (3 < 2 and f3 > 1. If (3 > 2, 
then any proper networks cannot be generated. When 
(3 < 1, there are outer links more than m. Therefore, 
the appropriate condition for the parameter (3 is 

1 </3 <2. 

Moreover, in the case of 1 < (3 < 2, condition (6) 
permits the presence of inner links, which is the con- 
tent of (v). Namely, old nodes themselves present in 
the network are able to connect directly through some 
of new links. Among m newly added ones, there are 
such inner links as many as 

m — (2 — (3)m = (/3 — l)m. 

When (3 = 1 as in [3, 4], this model coincides with the 
original BA model and all of m new links are connec- 
tions between a new node and old nodes. 

In [14], under the condition 1 < (3 < 2, it was 
proved that this modified BA model (i.e., (5) and (6)) 
generates a network whose degree distribution follows 
a power law with exponent 

„ 2 

So we obtain 2 < 7 < 3 from our model. 

The higher the ratio of inner links (i.e., (3 — 1 > 
2-/3 and the difference of both sides becomes larger) , 
the more 7 approaches 2.0 from 3.0. Table 1 due to 
[8, p. 80] exhibits undirected networks with lower 7 (or 
larger (3). Our intuition tells us that those networks 
grow by acquiring inner links as well as outer ones and 
new nodes. 



Table 1 



Network 


7 09) 


movie actors 


2.3 (1.5) 


Internet (domain) 


2.2 (1.7) 


Internet (router) 


2.3-2.5 (1.3-1.5) 



3. Directed networks 

The framework of the modeling for directed net- 
works with power law degree distributions is similar 
to that of undirected networks. Based on the five pro- 
cedures (i)-(v) of Section 2 we deduce two degree dis- 
tributions for a directed network, considering both in- 
coming and outgoing directions. 

As before, at each time step, one new node and 
m new directed links are added to the network. This 
model also allows presence of inner links connecting di- 
rectly old nodes that already exist in the network. The 
mechanism of preferential attachment is used when in- 
ner or outer directed links attach to old nodes is pro- 
portional to their in-degrees or out-degrees. 

Let us denote by ki the in-degree of ith node Vi. 
First we consider the in-degree distribution and as- 
sume that preferential attachment can be applied to 
in-degrees. If we denote by C(t) the total in-degree of 
the network at time step t, then C{t) « mt instead of 
(4). Hence, using (iii) and (iv) of Section 2, the rate 
equation (5) and the initial condition (6) should be 
replaced by 

¥ =am ^ = °? ki(ti) = (1 - a)m, (7) 

respectively. Here the parameter a regulates the 
growth rate of network's in-degrees and < a < 1. 
Remark that am is a proportionality constant that is 
related to preferential attachment. It is the quantity 
of in-degree's increase within one time step for the side 
of old nodes. Alternatively, using the delta function, 
one can write them in a single equation as 

dk k 

-±=a-j- + (l-a)mS(t-ti) (t > U). 

Employing a similar argument to the one in [14], 
we get from (7) 

p(jfe) oc fc-a+V") 

for the in-degree distribution. In what follows the 
derivation of this is given for the sake of complete- 
ness. For another proof using the master equations 
see [8, 14]. 

The solution of Eq. (7) is given by 

fci(t) = (l-a)mW (*>ti), 
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where U is the time when a node Vi enters the network. 
It is assumed that the time interval s = tj+i — (for 
each i) is a constant, usually s = 1. 

In order to obtain the in-degree distribution for 
our model, we could adopt the method employed in 
[3, 4]. However, we will use the following alternative 
approach, because it is easier to follow and, moreover, 
it gives the same asymptotic behavior for the in-degree 
distribution as the method in [3] or [4]. 

To deduce the probability p(k) that a randomly 
chosen node v has an in-degree k, it suffices to es- 
timate the number of time steps t^s for nodes Vi's, 
which satisfy the inequalities 



k - 1 < h{t) = (1 - a)m 



< k. 



(8) 



This can be written as 



t 



-i -l/a 



(1 — a)m 



<U<t 



k- 1 
(1 — a)m 



-i -l/a 



In the case of s = 1, the number of nodes satisfying 
(8) is equal to the difference of both sides of the above 
inequalities: 

t{{\ - a)m} 1 ' a {(l - l/fc)~ 1/Q - 

Making use of (1 — l/fc) -1 /" « 1 + a~ x /k for large 
k, we see that the number of nodes satisfying (8) is 
approximated by 



/a 



Since at time t there exist t+mo nodes, the probability 
p(k) is thus given by 



fc- 1 - 1 /". 



t + m 

In the limit t — > oo we conclude that 

p(k) oc k- (1+1 / a \ 

for all large k, i.e., 7i n = 1 + l/a. 

Next we assume that preferential attachment can 
be applied to out-degrees and denote by ii the out- 
degree of the ith node. Solving a similar equation to 
(7) for ti 



dl 



= (3m— = fi- 
at mt t 



£i(ti) = (1 - /3)m, 



(9) 



leads to another power law for the out-degree distri- 
bution 

p(/)«r( 1+1 /«. 

Here the parameter f3 regulates the growth rate of net- 
work's out-degrees and < (3 < 1. 



In summary the in- and out-degree distributions 
of directed networks follow the power laws with expo- 
nents 



1 1 

7;„ = 1 + -, 7out = l + -j- 
a p 



(10) 



Using the initial conditions of (7) and (9), among 
m newly added links, there are outer links as many as 



while there are 



(2 - a - /3)m, 



(a + (3- l)m 



inner links, i.e., not attached to a new node, as stated 
in (v) of Section 2. When a + j3 = 2, note that any 
proper networks are not generated. Both numbers of 
links entail the inequalities 



1 < a + f3 < 2. 



(11) 



When a ;$ 1 (i.e., a is close to and less than 1), 
(10) implies that the higher the ratio of inner links 
(namely, a + f3 — 1 > 2 — a — (3 and the difference 
between both sides becomes larger), the more the ex- 
ponent 7 out approaches 2.0 from 3.0. This remark will 
be used in the next section. 

4. The WWW and the networks of 
citation and phone-call 

In this section we deal with three kinds of directed 
networks with power law degree distributions; the 
World Wide Web, citation networks and phone-call 
networks. Table 2 due to [5, p. 72] or [8, p. 80] exhibits 
exponents of their power law in- and/or out-degree 
distributions. 



Table 2 



Network (reference) 


7in (a) 


7out (0) 


WWW ([9]) 


2.1 (0.9) 


2.38 (0.7) 


WWW ([2]) 


2.1 (0.9) 


2.45 (0.7) 


WWW ([7]) 


2.1 (0.9) 


2.72 (0.6) 


citation ([12]) 


3.0 (0.5) 


-(o) 


citation ([15]) 


2.0 (1.0) 


-(o) 


phone-call ([1]) 


2.1 (0.9) 


2.1 (0.9) 



We apply the results of the previous section sep- 
arately to each of those networks. Using Table 2, we 
compare the power law exponents of their in- and/or 
out-degree distributions with those that are deduced 
from the model. For that purpose we examine the 
quantity or ratio of inner or outer links in new di- 
rected links and the initial conditions for incoming 
and/or outgoing links. 
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It will be shown that the ratios of inner links of the 
WWW, a citation network and a phone-call network 
are approximately equal to 0.5, and 1.0, respectively. 
A conclusion drawn from the observations is that the 
WWW can be thought of as an intermediate of two 
extremes; citation networks and phone-call networks. 

A. The WWW 

The World Wide Web (WWW) is a directed net- 
work with nodes (Web pages) and hyperlinks. Since 
its first appearance in the early 1990s, the WWW 
has been extensively studied, beginning with seminal 
works [2, 3, 7] among others. It is assumed that pref- 
erential attachment can be applied to both in- and 
out-degrees. So, the WWW has power law in- and 
out-degree distributions and other interesting proper- 
ties. 

Due to [8] or [10], in May 1999 the WWW con- 
tained 203 x 10 6 nodes and 1466 x 10 6 hyperlinks, 
and in October 1999 there were 271 x 10 6 nodes and 
2130 x 10 6 hyperlinks. This data suggests a large 
amount of inner links. 

In the WWW the initial incoming links of most 
nodes seem relatively rare. So, setting a ^ 1 in (7), 
we obtain the inequality 

7in £2 

from (10). The model explains the reason why the 
exponents 7i„ of the in-dcgrcc distributions for the 
WWW are close to 2.0 as Table 2 shows. 

On the other hand, most new nodes of the WWW 
have more outgoing links than incoming ones. So, 
using the initial conditions in (7) and (9), we have 

1 — a < 1 — (3, and hence generally the inequality 

7in < 7out 

also holds from (10). Table 2 tells us that this inequal- 
ity together with (11) is satisfied in the WWW. 

The assumption that the number of inner links is 
larger than that of outer ones is equivalent to the in- 
equality 

7out < 3. 

This immediately follows from the fact that two rela- 
tions 

a + /3-l>2-a-/3, a < 1, 

yield /3 > 0.5 and 7 out < 3, and vice versa. Table 

2 suggests that such an assumption is effective and 
that a + (3 ^ 1.5 holds in the WWW, which implies 
(a + [3 — l)m > (2 — a — [3)m. Therefore, the number 
of inner links, among m new directed links, tends to 
be larger than that of outer links, but the difference 
does not become so large. 



B. Citation network 

A similar argument may be employed in a citation 
network of academic papers. The studies of citation 
networks have a long history than the WWW (see [6, 
11]). Suppose preferential attachment can be applied 
to cited papers; another 'rich-get-richer' phenomenon. 
For the same reason as above, we can set a £ 1 in 
(7) for the initial condition of new papers. Thus the 
in-degree distribution for a citation network turns out 
to obey a power law with exponent 7; n ^ 2. This 
result is in favor of [15] rather than [12]; 7; n m 3. 

On the other hand, we must set (3 ^ 0, since 
there are no inner links in a citation network. How- 
ever, we cannot apply (10) for 7 out , because we do 
not know whether preferential attachment is valid for 
out-degrees in a citation network. 

C. Phone-call network 

As for the in-degree distribution of a phone-call 
network, Table 2 also exhibits the exponent with lower 
7in and hence a ^ 1. The reason is the same as a ci- 
tation network. 

In contrast to a citation network or the WWW, 
however, we guess that there are much more inner 
links than outer ones in a phone-call network. It im- 
plies (3 ;$ 1 from the remark at the end of Section 3 
and this can be verified by Table 2, too. If preferential 
attachment is also valid for out-degrees in a phone- 
call network, then the out-degree distribution obeys a 
power law and its exponent 7 ou t tends to be close to 
2.0, as reported in an early paper [1]. 
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